Learning to See Physics via Visual De-animation
نویسندگان
چکیده
We introduce a paradigm for understanding physical scenes without human annotations. At the core of our system is a physical world representation that is first recovered by a perception module and then utilized by physics and graphics engines. During training, the perception module and the generative models learn by visual de-animation — interpreting and reconstructing the visual information stream. During testing, the system first recovers the physical world state, and then uses the generative models for reasoning and future prediction. Even more so than forward simulation, inverting a physics or graphics engine is a computationally hard problem; we overcome this challenge by using a convolutional inversion network. Our system quickly recognizes the physical world state from appearance and motion cues, and has the flexibility to incorporate both differentiable and non-differentiable physics and graphics engines. We evaluate our system on both synthetic and real datasets involving multiple physical scenes, and demonstrate that our system performs well on both physical state estimation and reasoning problems. We further show that the knowledge learned on the synthetic dataset generalizes to constrained real images.
منابع مشابه
Machine learning based Visual Evoked Potential (VEP) Signals Recognition
Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...
متن کاملImproved Effectiveness of Cueing by Self-Explanations when Learning from a Complex Animation
A major problem in learning from instructional animations is that the complex perceptual and cognitive processing exceeds the learner’s limited processing capacities. Although attention cueing might help learners in focusing on essential parts of an animation, previous studies have shown that it does not necessarily improve learning performance. This study investigated whether generating self-e...
متن کاملRainbow of Translation: A semiotic approach to intercultural transfer of colors in children's picture books
Abstract The aim of intercultural translation is to communicate. Communication is acted via verbal as well as visual means. The interaction of verbal and visual means of communication makes a set of complex situations which demand special attention in translation. One context in which the interaction of visual and verbal elements gets vital importance is children’s picture books. Color is an in...
متن کاملRainbow of Translation: A semiotic approach to intercultural transfer of colors in children's picture books
Abstract The aim of intercultural translation is to communicate. Communication is acted via verbal as well as visual means. The interaction of verbal and visual means of communication makes a set of complex situations which demand special attention in translation. One context in which the interaction of visual and verbal elements gets vital importance is children’s picture books. Color is an in...
متن کاملThree-Dimensional Anatomy of Human Body, With Animation, for Medical Training
Every day, surgeons operate on thousands of patients around the country. For each operation, the surgeon and support staff have trained in some way to perform the delicate surgical procedures, some of them training on cadavers in medical school and others learning by doing. For each operation, the patient has gone through a learning experience as well, via conversations with doctors and nurses,...
متن کامل